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METHOD AND APPARATUS TO PERFORM MODULATION USING 
INTEGER TIMING RELATIONSHIPS BETWEEN INTRA SYMBOL 
MODULATION COMPONENTS 

BACKGROUND 

Field of the Invention 

[0001] Embodiments of the invention relate to modulation. More 

specifically, embodiments of the invention relate to an improved encoding 
density modulation scheme. 
Description Of The Related Art 

[0002] Various forms of modulation have long been used to encode 

data with greater efficiency so that more data can be transmitted during a 
particular time period over a transmission medium. Combinations of various 
modulation techniques such as, pulse width modulation, amplitude 
modulation and rise time modulation have been employed to improve the 
encoding density of modulation schemes. See for example, copending 
application entitled "Symbol-Based Signaling For An Electromagnetically- 
Coupled Bus System/' Serial Number 09/714,244. However, such schemes 
typically employ a fixed base pulse residing in a fixed location within the 
symbol period. This can have a significant limiting effect on the possible 
modulation symbols available and therefore the modulation gain achievable. 

BRIEF DESCRIPTION OF THE DRAWINGS 
[0003] Embodiments of the invention are illustrated by way of example 

and not by way of limitation in the figures of the accompanying drawings in 
which like references indicate similar elements. It should be noted that 
different references to "an" or "one" embodiment in this disclosure are not 
necessarily to the same embodiment, and such references mean at least one. 
[0004] Figure 1 is a spreadsheet of a symbol set according to one 

embodiment of the invention. 

[0005] Figure 2 is a spreadsheet showing piecewise linear (PWL) 

depictions of some symbols according to one embodiment of the invention. 
[0006] Figure 3 is a spreadsheet of symbols having the same 

parameters as used for the symbols of Figure 2. 
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[0007] Figure 4 is a block diagram of a modulator of one embodiment 

of the invention. 

[0008] Figure 4A is a block diagram of the AM modulator of Figure 4. 

[0009] Figure 5 is a block diagram of a demodulator of one 

embodiment of the invention. 

[0010] Figure 5A is a more detailed block diagram of the slot in edge 

group detector of Figure 5. 

[0011] Figure 5B shows the slot to edge group converter of Figure 5. 

[0012] Figure 5C is a more detailed block diagram of the AM 

demodulator of Figure 5. 

[0013] Figure 6 is a spreadsheet showing state assignments for the 

embodiment of the modulator and demodulators shown in Figures 4 and 5. 
[0014] Figure 7 is a block diagram of the system incorporating one 

embodiment of the invention. 

DETAILED DESCRIPTION 
[0015] One embodiment of the invention provides an improved 

modulation technique premised on multiples of a fundamental unit of time 
(FTU). Selection of the FTU size may be based on the minimum phase slot 
size for the system. In equalized channels, jitter and other noise factors over 
manufacturing variations become the limiting factor on slot size that can be 
resolved for a given bit error rate (BER). In this scheme, data is encoded in 
symbols in relation to the timing relationship of modulation objects within the 
symbol. In an embodiment, a symbol period ("S") is an integer ("N") 
multiple of the FTU. The symbol period may be selected in accordance with 
latency, complexity, and timing resolution possible with the timing 
constraints in the system employed. The delay lock loops (DLL), phase lock 
loops (PLL), delay elements, process employed, design of RF channel, 
manufacturing variations, and feature size all effect the desired symbol 
period. 

[0016] In this description a phase slot for phase modulation within the 

symbol period is abbreviated as "s." The examples below have s = 1 FTU, in 
some embodiments, variable phase slots may be constructed from any integer 
multiple of the FTU, where the FTU in this case must equal or exceed the 
minimum time increment resolvable in the semiconductor process. The time 



42P15169 



2 Express Mail No. EV339910561US 



during which single transition in amplitude may occur between edges plus 
the minimum time space during which a new edge transition is not permitted 
in a communication channel to control intersymbol interference (ISI) is 
abbreviated as TV The set of {TJ for i = 1, 2,. . .n is defined as the set of 
different widths required for general edge rate and amplitude modulation 
that include the ISI spacer. The T { for each member of the set is an integer 
multiple of the FTU. In one embodiment, the integer multiple and other 
modulation parameters are chosen based on optimizing the bit error rate 
(BER) performance in an ISI communication channel in tandem with 
optimizing the channel equalization. 

[0017] In chip to chip interconnect signaling, the amplitude is 

frequently permitted to have two (binary), three pulse-amplitude modulation 
(3 PAM) or four pulse amplitude modulation (4 PAM) possible levels. For the 
case of three amplitude levels, ground, V dd and V ref (V dd /2) are reasonable 
implementation choices. With three possible amplitude levels, two 
transitions are possible at each edge. This permits one bit of amplitude 
modulation to be encoded per edge. It is of course contemplated that in 
addition to amplitude modulation, rise time modulation is supported by the 
modulation scheme described below. Moreover, other embodiments may use 
an arbitrary number of amplitude levels and/or rise times. The number of 
possible transitions per edge is abbreviated as "B" in this description. Thus, 
for three amplitude levels, B = 2. The total symbol set for a set of parameters 
{Tj}, B and S is formed by choosing all combinations of the set {TJ that fit 
within S. Symbol mapping logic may be used to select a subset of total 
available symbols based on the ISI and the other properties of symbols 
including the symbols needed to form an integer modulation factor across the 
small number of input/output (I/O) pads. Stated differently, since as 
discussed below, the modulation is log base two of the total number of states, 
some symbols may be unused because the log base two of the total status 
must be an integer. However, where a plurality of modulators (M) are 
employed, e.g. M > 2, it is possible to modulate fractional bits across plural 
modulators. For example, if M = 2 and each modulator could modulate 4 1/2 
bit, the additional half bit can be modulated by both M 1 and M 2 to yield nine 
bits of total modulations. 
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[0018] Figure 1 is a spreadsheet of a symbol set according to one 

embodiment of the invention. In this example, FTU is equal to s is equal to 
120 pico seconds and T is equal to 3 FTU. Bearing in mind that T is equal to 
the sum of the transition region (rise time) plus the time during which no 
transition may occur. For this example, B is equal to 2, which implies two 
possible transitions from the current amplitude state which further requires a 
minimum of three possible amplitude levels. S is equal to 12 FTU and each 
column equals one FTU. Thus, in this example, the symbol period is equal to 
twelve phase slots. All the possible symbols may be derived by placing the 
modulation object T in all the possible places it may occur within the symbol. 
For this 3 FTU "T" example, the modulation object T is defined as tss (where t 
signifies a transition which may in theory occupy any fraction of the 3 phase 
slot wide "T".) in consecutive cells along a row in a spreadsheet, "p" notation 
serves as a wildcard to represent multiple "T" as follows: the sequence of one 
or more p's may be paired with one t. This is defined to be equivalent to all 
possible combinations of t substituted for the p's one at a time with the 
remaining p's replaced by s'. For example, pppt is equal to (ssst, ssts, stss and 
tsss) and tppp is equal to pppt. 

[0019] The first row of the spreadsheet shows the symbol in which no 

transitions occur. In one embodiment of the invention, this state is unused. 
As can to be seen in the spreadsheet of Figure 1, 0 to 4 edge transitions are 
possible within the symbol period. This is found by dividing S by T. The 
column headed number of t's indicates the number of transitions within the 
symbol. The number of p's indicates the number of wild cards for t present in 
the symbol. The number of p states is equal to the number of p's plus 1. The 
number of total states for row is given by B raised to the power of the number 
of transitions (t's) times the number of p states. For example, for the second 
row, B 1 x 10 = 2 1 x 10 = 20. Similarly, for the third row, B 2 x 7 = 4 x7 = 28 and 
so forth. This large number of states is in part a result of permitting the 
modulation objects to overlap the positions of other modulation objects 
within different symbols. Stated differently, because the modulation object 
can occupy any position within a symbol period, except slots occupied by 
other modulation objects of the same symbol, higher modulation efficiency is 
achieved. 
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[0020] The total number of states for all rows aggregated less the 

unused state reflected in the first row is 308 total states. The total modulation 
is given by log base two of the total states. Since the input to a set of 
modulators must be an integer, effective modulation is truncated to 8. Thus, 
for this example, there would be a number of unused symbols from the 
possible symbol set. While it is possible for any set of T, B, and S to 
graphically represent the possible symbol set and do a spreadsheet 
calculation, this is inefficient timing consuming and error prone. The 
following close form solution yields the total states for symbols formed from 
a single modulation object and was mathematically derived from the above 
counting rules. 

Fix(S/T) n 

Total states (TO = S Bn S C (n,m) * (S - n * T + l) m 

n=l m = l 

Where C(n,m) is a two dimensional matrix of coefficients that was derived 
from the coefficients for integer sums of the respective powers as follows: 

D 

first, note that the integer sum for "i" from zero to "n" to the "p" power, ^ i p , 

i=0 

may be expressed as a polynomial of order "p+1". The coefficients for these 
integer sum polynomials may be found in the literature. Next, counting rules 
were developed corresponding to the notation employed in counting the 
symbols in the spreadsheet examples of figures 1 and figure 3. Finally, the 
resulting counting formulas contains nested sums of the form: 

x i j y 

i =0 j=0 k=0 z=0 

These nested loops were unfolded to form a polynomial of order equal 
to the maximum number of edges in a modulated symbol with resulting 
C(n,m) coefficients, "n" corresponds to the number of transitions that are 
being counted by the inner loop; "m" sums the polynomial for a fixed number 
of transitions. The argument of the polynomial, (S - n * T + 1), reflects the 
number of slots remaining after subtracting the slots required to fit "n" 
transitions. Finally, it should be noted that the sum of C(n,m) coefficients 
must be equal to one for any given polynomial. 

[0021] Phase and amplitude modulation are modeled in this equation, 

in which B is the number of transitions possible at an edge. Thus, B becomes 
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the base for the exponential relationship per edge for the sum of amplitude 
and rise time modulation. For example, for two possible state transitions, B = 
2. The outer summation computes the amplitude state multiplier for the 
number of edges "n" from 1 (zero case not used) to the maximum number 
"fix(S/T)" of edges that will fit in the symbol. The amplitude state multiplier 
is then multiplied by the total number of phase states associated with V 
edges as computed by the inner summation to form the total number of 
states. This closed form solution allows optimization of parameters for 
systems using a single type of modulation object. 

[0022] Figure 2 is a spreadsheet showing piecewise linear (PWL) 

depictions of some symbols according to one embodiment of the invention. 
In a depicted example, the set {TJ = {T lr T 2 } (i.e., rise time modulation objects 
may be used to form the symbols). Again in this example, the phase slot size 
s = 1 FTU. For drawing convenience, the rise time of Tj is drawn as 1 FTU and 
the rise time of T 2 is drawn as 2 FTU. In other embodiments, modulation 
objects may have other resolvable rise time values. The modulation in the 
drawing depicts three amplitude levels. To avoid double counting, each 
symbol starts at V dd /2 amplitude. However, symbols could start at a ground 
or V dd amplitude as determined by the preceding symbol. Following the 
same nomenclature as Figure 1, T x = 4 FTU's (shorter rise time), T 2 = 5 FTU's 
(longer rise time), B x = 2, B 2 = 2 and S = 16. An FTU is again assumed to be 120 
pico seconds. T x is defined as tjsss and T 2 is defined t 2 ssss. These symbols 
illustrate symbols having two kinds of modulation objects within the symbol. 
The total number of states for a two modulation object symbol set is given by 
the equation: 

Total States (T a T 2 ) = 

Fix(S/T,) Fix((S-T,)/T 2 ) n, +n 2 

£ X B . n,B 2 n2 ((n.+n2)!/(n,!n 2 !)) £C((n, + n 2 ),m) * (S - n,T, - n 2 T 2 + l) m 

ni=l n 2 =1 m=l 

Where C((n 1 +n 2 ),m) is the two dimensional matrix of coefficients for the 
nested collapsing sums formula that was derived from the coefficient for 
straight sum of powers as discussed above in relation to Figure 1. Where "nl" 
corresponds to the number of "T/' modulation components and "n2" 
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corresponds to the number of "T 2 " modulation components. "B/' and "B 2 " are 
the number of possible transitions per edge for "T/' and "T 2 " respectively. 
[0023] The mixed term gain, the gain attributable to symbols 

containing two different rise times, increases the modulation efficiency in one 
embodiment of the invention. This close form solution permits optimization 
of parameters where two types of modulation objects are supported e.g., 
modulation objects having different rise times or occupying a different 
numbers of slots. This formula may be naturally extended to support any 
number of different sized modulation objects. 

[0024] Figure 3 is a spreadsheet of symbols having the same 

parameters as used for the symbols of Figure 2. In fact, Figure 2 corresponds 
to row number 58 on Figure 3 (the 5 th "T1T2T2" symbol row). The clocking 
scheme in one embodiment of the invention is depicted above the symbol 
columns. This spreadsheet reflects 1678 total states which is in exact 
agreement with two modulation object total states equation above. 
[0025] Figure 4 is a block diagram of a modulator of one embodiment 

of the invention. Modulator 400 may implement, for example, modulation 
using symbols of the format depicted in Figure 3. Modulator 400 includes 
phase modulation (PM) symbol table 402, and amplitude modulation (AM) 
symbol table 404, which map an incoming data stream to symbols such as 
those depicted in Figure 3. In one embodiment, the modulator and 
demodulator designs have an orthogonal architecture and circuit 
implementations of the AM and PM. The orthogonal architecture leads to 
parallel paths rather than a deepening of the critical path as would be the case 
where the AM and PM were interdependent. Similarly, in embodiments 
having rise time components, the rise time modulation (RTM) may have an 
orthogonal architecture to the AM and PM resulting in parallel processing of 
this component as well. A clock labeled "2x" is supplied to an edge detection 
unit 414. Delay units 410 each provided a delay equal to one phase slot to 
permit generation of a pulse at the appropriate distance from the appropriate 
reference clock edges as depicted at the top of Figure 3. The multiport switch 
timing generator, 408, generates control signals to insert the phase timing by a 
control multiplexer that selects the correct pulse for the phase modulation 
timing for the amplitude modulator 406. Note that sufficient delay should be 
provided to allow the control signals to settle before the PM timing edges 
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pass through the multiplexer. The correct AM voltage level is determined by 
control signals from AM symbol table 404 in conjunction with the prior AM 
state information contained within AM modulator 406. This is described in 
more detail with reference to Figure 4A. Both the PM symbol table 402 and 
the AM symbol table 404, change control signals every third slot edge group 
since the clock edge changes is a function of the edge input into the block, 
thereby efficiently implementing the entire symbol timing by repeatedly 
reusing the hardware for only one edge group. In one embodiment, the 
detailed timing within the edge groups is entirely generated by delay 
elements calibrated to the process using a DLL or similar technique. 
[0026] Modulator 400 also includes a forwarded clock circuit 416 to 

mimic the delay in the remainder of the modulator and provide a forwarded 
clock with timing consistent with the output of the modulator. In one 
embodiment, the forwarded clock has the edge slot assignments shown in 
Figure 3. The forwarded clock solution minimizes the effect of jitter on the 
transmitted signal. Alternative embodiment has no forwarded clock, but 
rather uses an embedded clock signal recovered at the demodulator with a 
clock recovery circuit. In another embodiment, a globally distributed clock 
may be used. 

[0027] Figure 4A is a block diagram of the AM modulator of Figure 4. 

The AM modulator 406 includes the level determination logic block 432 a 
synchronization unit 434, and transmit driver 436. A one slot delay 410 is 
introduced to maintain timing consistency with the clock symbol coming 
from the phase modulation portion of the modulator 400. The logic level 
determination unit 432 determines the AM level based on the signal from the 
AM symbol table and the prior state of the AM level as feedback from 
synchronization unit 434. The transmit driver 436 ultimately asserts the 
signal contingent on the symbols onto the bus with correct phase modulation 
and amplitude modulation. 

[0028] Figure 5 is a block diagram of a demodulator of one 

embodiment of the invention. Demodulator 500 includes a first and second 
edge-detect circuits 506 which receive the forwarded clock and data. In one 
embodiment, each edge detector 506 generates a slot sized pulse. The 
generated slot size pulse simplifies the remainder of the design and 
eliminates the need for reset circuits in the circuits downstream. The slot 
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sized pulses are provided as an input to the clock deskew circuit 510. The 
output of the deskew circuit connects to the input of the slot in edge group 
detector 512 (described in more detail with reference to Figure 5 A below). In 
one embodiment, the clock deskew circuit 510 inserts delays that compensate 
for different routing delays between clock and data and compensates for 
circuit variations to individually optimize the phase and amplitude 
demodulation eyes. The slot in edge group detector provides the stream of 
per slot edge information (discussed below) to the slot to edge group 
converter 415. Incoming data is also fed into an AM demodulator 516 
(described in more detail with reference to Figure 5C below). The AM 
demodulator 516 also receives an eye strobe signal from the slot in edge 
group detector 512. In other possible embodiments, the eye strobe signal 
could be directly derived from the CLK and DATA_IN signals to further 
decouple the AM and PM demodulators. The slot in edge group detector 512 
drives the slot to edge group converter 514 (described in more detail with 
reference to Figure 5B). The slot to edge group converter 514 assembles four 
edge group values and indexes into a PM demapping table. The slot to edge 
group converter 514 provides the edge group to demapping logic 520, which 
includes an edge group symbol converter 522 and a symbol to 8 bits converter 
524. The PM converted symbol is supplied to an AM PM combiner 530, 
which also receives the AM offset from AM demodulator 516. Frame 
synchronization unit 518 supplies the clock to the register 532 used to 
synchronize the output of the demodulator based on the clock from the 
synchronization unit 518. 

[0029] Figure 5A is a more detailed block diagram of the slot in edge 

group detector of Figure 5. The slot in edge group detector compares data 
pulses with appropriately slot delayed clock pulses to determine the slots in 
which phase changes occur. A plurality of slot delays and arbiters are used 
for this purpose. The information is streamed from the slot in edge group 
detector 512 on a per slot basis and is then reassembled by the slot to edge 
group converter 514. 

[0030] Figure 5B shows the slot to edge group converter of Figure 5. 

The slot to edge group converter 514 uses delay elements 560 and 562 to time 
align the slot edge detection data provided by slot in edge group detector 512 
to determine where within the edge group the phase transition occurred. In 
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this example, four edge groups form a symbol. The edge group assembled by 
the slot to edge group converter 514 is provided to the edge group symbol 
converter 522 as described above in connection with Figure 5. 
[0031] Figure 5C is a more detailed block diagram of the AM 

demodulator of Figure 5. The AM demodulator 516 samples the level at the 
end of each edge group and determines whether a change in level has 
occurred. Transition detect logic 550 detects these transitions. The change 
indexes a table to determine whether the change was up or down for this 3 
AM state example. The four possible transitions per edge group are then 
assembled into a symbol group using the slot to edge converter described 
above. The edge group symbol converter 554 indexes the edge group into 
edge group offset converter which includes an AM symbol demapping table 
to determine the AM value originally set by the modulator AM symbol table 
of the modulator (400 of Figure 4). 

[00321 The various circuits shown block diagrammatically in Figure 4- 

5C used delays, multiplexing, gates, drivers, arbitrators, and operational 
amplifiers. Some embodiments of the invention may employ additional 
multiplexing to save silicon area/cost. In one embodiment, all gates, 
multiplexers and control logic are implemented with near minimum sized 
transistors. This will result in only a nominal increase in the size of the 
transceiver relative to a binary transceiver. 

[0033] Figure 6 is a spreadsheet showing state assignments for one 

embodiment of the modulator and demodulators shown in Figures 4 and 5. 
[0034] Figure 7 is a block diagram of the system incorporating one 

embodiment of the invention. The processor 600 includes a modulator 400 
and demodulator 500. The processor 600 is coupled to a chipset 602 (which 
also contains the same modulators and demodulators), which is coupled to a 
memory bus 612 and an I/O bus 610. The chip set includes a memory 
controller 614, which also includes a modulator 400 and a demodulator 500. 
The improved modulation efficiency provided by modulator 400 and 
demodulator 500 ameliorates bandwidth bottlenecks between the processor 
600 and memory subsystems. It may similarly relieve bottleneck between 
chip set 602 and I/O devices. 

[0035] The memory controller interacts with the memory 604 over 

memory bus 612 via repeater 616. In one embodiment, repeater 616 is a low 
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latency regenerative repeater. Using the above-described modulation 
scheme, since amplitude transitions occur in known FTU time slots, they may 
be regenerated with a minimum regeneration processing delay. Conversely, 
if an asynchronous modulation scheme were used complete demodulation 
and remodulation would be required to regenerate the symbols. Such low 
latency regenerative repeaters 616 may be used where signals travel long 
distances or, for example, in point-to-point connected buffered dynamic inline 
memory modules (DIMMs) in a memory subsystem where pin count and cost 
on the memory controller require cascade rather than replicate busses. 
"Long" as used in this context is when the channel loss would cause the BER 
to exceed the specification without regeneration. 

[0036] An I/O device 606, which also contains a modulator 400 and a 

demodulator 500, is coupled to I/O bus 610 and may receive symbols 
modulated as previously described such that positions of modulation objects 
may overlap between symbol of the symbol set. The I/O device may include, 
for example, a disk controller. 

[0037] In another embodiment, the memory controller is embedded in 

the processor. Such an embodiment may or may not have a chip set, but in 
any event, the memory interface would need to have the corresponding 
modulator /demodulator to gain a benefit of the described modulation 
technique during memory accesses. 

[0038] In the foregoing specification, the invention has been described 

with reference to specific embodiments thereof. It will, however, be evident 
that various modifications and changes can be made thereto without 
departing from the broader spirit and scope of the invention as set forth in the 
appended claims. The specification and drawings are, accordingly, to be 
regarded in an illustrative rather than a restrictive sense. 
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